Pagoda: A Model for Autonomous Learning in Probabilistic Domains

نویسنده

  • Marie desJardins
چکیده

ley as a test bed for designing intelligent agents. The system consists of an overall agent architecture and five components within the architecture. The five components are (1) goaldirected learning (GDL), a decisiontheoretic method for selecting learning goals; (2) probabilistic bias evaluation (PBE), a technique for using probabilistic background knowledge to select learning biases for the learning goals; (3) uniquely predictive theories (UPTs) and probability computation using independence (PCI), a probabilistic representation and Bayesian inference method for the agent’s theories; (4) a probabilistic learning component, consisting of a heuristic search algorithm and a Bayesian method for evaluating proposed theories; and (5) a decision-theoretic probabilistic planner, which searches through the probability space defined by the agent’s current theory to select the best action. PAGODA is given as input an initial planning goal (its overall behavioral goal, for example, “maximize utility”) and probabilistic background knowledge about the domain. The agent selects learning goals (features in the world about which predictive theories will be formed) that will maximize the agent’s ability to achieve its planning goal. The learner uses the probabilistic background knowledge to select learning biases for each learning goal and performs a heuristic search through the space of UPTs, using Bayesian techniques to evaluate the theories. The planner uses the best current theory to choose actions that satisfy its planning goal and to generate new learning goals for the inductive learning subsystem to focus on. My Ph.D. dissertation (desJardins 1992a)1 describes PAGODA (probabilistic autonomous goal-directed agent), a model for an intelligent agent that learns autonomously in domains containing uncertainty. The ultimate goal of this line of research is to develop intelligent problem-solving and planning systems that operate in complex domains, largely function autonomously, use whatever knowledge is available to them, and learn from their experience. PAGODA was motivated by two specific requirements: The agent should be capable of operating with minimal intervention from humans, and it should be able to cope with uncertainty (which can be the result of inaccurate sensors, a nondeterministic environment, complexity, or sensory limitations). I argue that the principles of probability theory and decision theory can be used to build rational agents that satisfy these requirements. PAGODA incorporates innovative techniques for using the agent’s existing knowledge to guide and constrain the learning process as well as a powerful new mechanism for representing, reasoning with, and learning probabilistic knowledge. Additionally, PAGODA provides a conceptual framework for addressing important open problems such as incremental, resource-bounded learning and knowledge-based learning and planning. PAGODA was implemented in the RALPH (rational agent with limited processing hardware) world, a simulated two-dimensional world used at the University of California at BerkeGoal-Directed Learning

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

PAGODA: An Integrated Architecture for Autonomous Agents

PAGODA (Probabilistic Autonomous GOalDirected Agent) is an autonomous intelligent agent that explores a novel environment, building a model of the world and using the model to plan its actions [desJardins, 1992b]. PAGODA incorporates solutions to the problems of selecting learning tasks, choosing a learning bias, classifying observations, and performing induetire learning of world models under ...

متن کامل

Representing and Reasoning With Probabilistic Knowledge: A Bayesian Approach

PAGODA (Probabilistic Autonomous GOal­ Directed Agent) is a model for autonomous learning in probabilistic domains [desJ ardins, 1992) that incorporates innovative techniques for using the agent's existing knowledge to guide and constrain the learning process and for representing, reasoning with, and learn­ ing probabilistic knowledge. This paper de­ scribes the probabilistic representation and...

متن کامل

Pagoda: a Model for Autonomous Learning in Probabilistic Domains Pagoda: a Model for Autonomous Learning in Probabilistic Domains

Machine learning approaches have traditionally made strong simplifying assumptions: that a benevolent teacher is available to present and classify instances of a single concept to be learned; that no noise or uncertainty is present in the environment; that a complete and correct domain theory is available; or that a useful language is provided by the designer. Additionally, much existing machin...

متن کامل

Goal-Directed Learning: A Decision-Theoretic Model for Deciding What to Learn Next

This paper describes a theory called Goal-Directed Learning (gdl) that uses the principle of decision theory to choose learning tasks. The expected utility of being able to predict various features of the environment is computed and those with highest expected utility can be used as learning goals, which an agent's inductive mechanism should form theories to predict. We present a general decisi...

متن کامل

Policy-based Coordination in PAGODA: A Case Study

PAGODA (Policy And GOal Based Distributed Autonomy) is a modular architecture for specifying and prototyping autonomous systems. A PAGODA node (agent) interacts with its environment by sensing and affecting, driven by goals to achieve and constrained by policies. A PAGODA system is a collection of PAGODA nodes cooperating to achieve some mutual goal. This paper describes a specification of PAGO...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • AI Magazine

دوره 14  شماره 

صفحات  -

تاریخ انتشار 1993